[V5] Return a BatchEncoding dict from apply_chat_template by default again by Rocketknight1 · Pull Request #42567 · huggingface/transformers

Rocketknight1 · 2025-12-02T16:49:43Z

This is basically PR #41626 again! Some of it got clobbered in the tokenizer refactor, but it's just as good the second time.

…nderlying tokenizer

…useful

github-actions · 2025-12-02T16:50:37Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: blenderbot, bloom, cohere, gpt2, gpt_sw3

HuggingFaceDocBuilderDev · 2025-12-02T17:06:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Rocketknight1 · 2025-12-03T15:16:13Z

cc @LysandreJik - this was one of the V5 PRs before, do I need to do anything special with this one, or can we just merge it to main?

zucchini-nlp

great, it was already approved once so lgtm 😄

zucchini-nlp · 2025-12-04T14:23:32Z

src/transformers/tokenization_utils_base.py

+        if not tokenize:
+            return_dict = False  # dicts are only returned by the tokenizer anyway


makes me wonder, do we need to support a combination of tokenize=True, return_dict=False or can we deprecate/remove return_dict over time? Can't think of cases when users want a list of tokens as output

Maybe we can get rid of it over time, but I think it's fine as a backward compatibility flag for now!

sure, i meant after v5 + several more minor releases, and if users are fine with it

…again (huggingface#42567) * Flip the default return type for `apply_chat_template` to match the underlying tokenizer * Remove test_tokenization_for_chat tests, which no longer do anything useful * Remove test_tokenization_for_chat tests, which no longer do anything useful * Fix test_encode_message tests * Fix test_encode_message tests * nit fix * Trigger tests * Remove test_tokenization_for_chat * make fixup * Add a little test to make sure that doesn't happen again * make fixup

Rocketknight1 added 9 commits December 2, 2025 16:44

Flip the default return type for apply_chat_template to match the u…

e31e466

…nderlying tokenizer

Remove test_tokenization_for_chat tests, which no longer do anything …

df855fe

…useful

Remove test_tokenization_for_chat tests, which no longer do anything …

52f0028

…useful

Fix test_encode_message tests

caf7303

Fix test_encode_message tests

51f85bd

nit fix

4117af1

Trigger tests

9ac4580

Remove test_tokenization_for_chat

3d595c6

make fixup

67797dc

Rocketknight1 marked this pull request as ready for review December 2, 2025 16:50

Rocketknight1 added 2 commits December 2, 2025 16:55

Add a little test to make sure that doesn't happen again

da06bdd

make fixup

a697124

zucchini-nlp approved these changes Dec 4, 2025

View reviewed changes

Rocketknight1 merged commit ce53cc0 into main Dec 4, 2025
24 checks passed

Rocketknight1 deleted the v5_chat_template_return_type branch December 4, 2025 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V5] Return a BatchEncoding dict from apply_chat_template by default again#42567

[V5] Return a BatchEncoding dict from apply_chat_template by default again#42567
Rocketknight1 merged 11 commits intomainfrom
v5_chat_template_return_type

Rocketknight1 commented Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

Rocketknight1 commented Dec 3, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

zucchini-nlp Dec 4, 2025

Uh oh!

Rocketknight1 Dec 4, 2025

Uh oh!

zucchini-nlp Dec 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if not tokenize:
		return_dict = False # dicts are only returned by the tokenizer anyway

Conversation

Rocketknight1 commented Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

Rocketknight1 commented Dec 3, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants